Benchmarking vision kernels and neural network inference accelerators on embedded platforms

نویسندگان

چکیده

Developing efficient embedded vision applications requires exploring various algorithmic optimization trade-offs and a broad spectrum of hardware architecture choices. This makes navigating the solution space finding design points with optimal performance challenge for developers. To help provide fair baseline comparison, we conducted comprehensive benchmarks accuracy, run-time, energy efficiency wide range kernels neural networks on multiple platforms: ARM57 CPU, Nvidia Jetson TX2 GPU Xilinx ZCU102 FPGA. Each platform utilizes their optimized libraries (OpenCV, VisionWorks xfOpenCV) (OpenCV DNN, TensorRT DPU). For kernels, our results show that achieves an energy/frame reduction ratio 1.1–3.2× compared to others simple kernels. However, more complicated complete pipelines, FPGA outperforms ratios 1.2–22.3×. [Inception-v2 ResNet-50, ResNet-18, Mobilenet-v2 SqueezeNet], it shows speed up [2.5, 2.1, 2.6, 2.9 2.5]× EDP [1.5, 1.1, 1.4, 2.4 1.7]× FP16 implementations, respectively.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Benchmarking Methodology for Embedded Scalable Platforms

Embedded scalable platforms (ESP) are a novel generation of platform architectures that yield optimal energy-performance operations while supporting a diversity of embedded application workloads. A companion methodology combines full-system simulation, pre-designed HW/SW interface libraries, high-level synthesis and FPGA prototyping to enable an effective design-space exploration which is drive...

متن کامل

A Preliminarily Study on Embedded Platforms for Computer Vision Applications

Today, silicon chips have become more and more powerful despite reduction in size. Various general processors, digital signal processors (DSPs) and graphic processing units (GPUs) have been embedded in countless electronic gadgets such as Personal Digital Assistants (PDAs), mobile hand-phone, digital cameras etc. With the high processing power of these embedded platforms and ever increasing siz...

متن کامل

Benchmarking Probabilistic Neural Network Algorithms

The progress of research in probabilistic neural network (PNN) and related issues is straight related to directly compare the performance of different PNN algorithm versions. In most cases, the PNN application in real life issues involves the classical activation function (Parzen-Cacoulos estimator) only. Although this estimator has been used in most experimental works so far, it is not the onl...

متن کامل

Benchmarking CPUs and GPUs on Embedded Platforms for Software Receiver Usage

Smartphones containing multi-core central processing units (CPUs) and powerful many-core graphics processing units (GPUs) bring supercomputing technology into your pocket (or into our embedded devices). This can be exploited to produce power-efficient, customized receivers with flexible correlation schemes and more advanced positioning techniques. For example, promising techniques such as the D...

متن کامل

Design, Development and Evaluation of an Orange Sorter Based on Machine Vision and Artificial Neural Network Techniques

ABSTRACT- The high production of orange fruit in Iran calls for quality sorting of this product as a requirement for entering global markets. This study was devoted to the development of an automatic fruit sorter based on size. The hardware consisted of two units. An image acquisition apparatus equipped with a camera, a robotic arm and controller circuits. The second unit consisted of a robotic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Systems Architecture

سال: 2021

ISSN: ['1383-7621', '1873-6165']

DOI: https://doi.org/10.1016/j.sysarc.2020.101896